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Abstract. The Pattern self-Assembly Tile set Synthesis (PATS) problem is to determine a set 
of coloured tiles that self-assemble to implement a given rectangular colour pattern. We give an 
exhaustive branch-and-bound algorithm to find tile sets of minimum cardinality for the PATS 
problem. Our algorithm makes use of a search tree in the lattice of partitions of the ambient 
rectangular grid, and an efficient bounding function to prune this search tree. Empirical data 
on the performance of the algorithm shows that it compares favourably to previously presented 
heuristic solutions to the problem. 

1 Introduction 
1.1 Background 

An appealing methodology for bottom-up manufacturing of nanoscale structures and devices is 
to use a self-assembling system of DNA tiles to build a scaffold structure on which functional 



units are deposited [5|, [9|, [lj]. A systematic approach to the design of self-assembling DNA 
scaffold structures was proposed and experimentally validated by Park et al. in jjj. However, as 
pointed out by Ma & Lombardi in [6], that design is wasteful of tile types, i.e. generally speaking 
the same scaffold structures can be assembled also from fewer types of specially-manufactured 
DNA complexes, thus reducing the requisite laboratory work. 

Ma & Lombardi [f| formulated the task of minimizing the number of DNA tile types re- 
quired to implement a given 2-D pattern abstractly as a combinatorial optimization problem, 
the Patterned self-Assembly Tile set Synthesis (PATS) problem, and proposed two greedy 
heuristics for solving it. In this paper, we present a systematic branch-and-bound approach to 
exploring the space of feasible PATS tilings, and assess its computational performance also ex- 
perimentally. The method compares favourably to the heuristics proposed by Ma & Lombardi, 
finding noticeably smaller or even provably minimal tile sets in a reasonable amount of compu- 
tation time. However, as the experimental results in Section 2] show, the computational problem 
still remains quite challenging for large patterns. 

1.2 Overview 

Our considerations take place in the abstract Tile Assembly Model (aTAM) of Winfree 
and Rothemund 

0,1 221, LL3|]. The key features of this model are as follows. The basic building 
blocks to construct different two-dimensional shapes are called tiles. A tile is a non-rotating 
unit square that has different kinds of glues of varying strengths associated with each of its 
four edges. Only finitely many different combinations of glues are allowed, that is, we consider 
only finitely many glue types; we have an unlimited supply of tiles, however. In the aTAM, 
tile assemblies are constructed through the process of self-assembly. Self-assembly begins with 
a seed assembly, an initial tile assembly, already in place in the discrete grid Z x Z. A new tile 
can extend an existing assembly by binding itself into a suitable position in the grid: we require 



a new tile to interact with existing tiles with sufficient strength to overcome a certain universal 
temperature threshold. A more detailed account of the aTAM model is given in section I2TT1 

In the PATS problem [fj], one associates a colour with each tile type and targets a specific 
coloured pattern within a rectangular assembly. The question is: given the desired colour 
pattern, what is the smallest set of (coloured) tile types that will self-assemble to implement it? 
The specifics of the PATS problem are given in section 12.21 

Our definition of the PATS problem restricts the self-assembly process to proceed in a 
uniform way. This simplification allows us to design efficient strategies for an exhaustive search. 
In section [3] we give full particulars of a novel branch-and-bound (B&B) algorithm for the PATS 
problem. For a pattern of size m x n, we reduce the problem of finding a minimal tile set to the 
problem of finding a minimum-size constructible partition of [m] x [n]. Here, constructibility 
of a partition can be verified in time polynomial in m and n. This leads us to construct a search 
tree in the lattice of partitions of the set [m] x [n] and to find pruning strategies for this search 
tree. In the concluding sections H] and [5] we give some performance data on the B&B algorithm 
and summarize our contributions. 



2 Preliminaries 

2.1 The abstract tile assembly model 

Our notation is derived from those of 0, H 0. First, to simplify our notations, let T> 



{N, E, S,W} be the set of four functions Z 2 -*■ Z 2 corresponding to the cardinal directions 
(north, east, south, west) so that N(x, y) = (x,y+l), E(x, y) = (x + 1, y), S = N^ 1 and W = E^ 1 . 

Let £ be a set of glue types and s ■ U x U -*■ N a glue strength function such that 
s(<7i,<72) = s(o"2)0"l) for all <7i,o"2 e £■ In this paper, we only consider glue strength functions 
for which s(<ti,o - 2) - if <J\ t 02 • A tile type t e S 4 is a quadruple (ajsf(t),aE(t),as(t),aw(t)) 
of glue types for each side of a unit square. Given a set U of glues, an assembly A is a partial 
mapping from Z 2 to U 4 . A tile assembly system (TAS) ET = (T,S,s,t) consists of a finite 
set T of tile types, an assembly S called the seed assembly, a glue strength function s and a 
temperature r e Z + (we use r = 2). 

To formalize the self-assembly process, we first fix a TAS 3~ = (T,S, s, r). For two assemblies 
A and A' we write A A' if there exists a pair (x,y) e Z 2 and a tile t € T such that 
A' - Au {((x,y),t)}, where the union is disjoint, and 

^( ffD (i), fffl -i(i(D(i,i,))) > r , (1) 
D 

where D ranges over those directions in T> for which A(D(x,y)) is defined. This is to say that 
a new tile can be adjoined to an assembly A if the new tile shares a common boundary with 
tiles that bind it into place with total strength at least r. 

Let -*■*$■ be the reflexive transitive closure of A TAS & produces an assembly A if A is 
an extension of the seed assembly S, that is if S A. Let us denote by Prod SF the set of all 
assemblies produced by 57 ' . This way, the pair (Prod 3*, -*■*<?) forms a partially ordered set. We 
say that a TAS 3F is deterministic if for any assembly A e Prod^ and for every (x,y) e Z 2 
there exists at most one t e T such that A can be extended with t at position (x,y). A TAS S? 
is deterministic precisely when Prod S? is a lattice. Also, the maximal elements in Prod 2f are 
such assemblies A that can not be further extended, that is, there do not exist assemblies A' 
such that A -*■<? A'. These maximal elements are called terminal assemblies. We denote by 
Term 3* the set of terminal assemblies of ST . If all assembly sequences 

S A\ A 2 ^sr ■■■ (2) 



terminate and Term S? = {V} for some assembly V, we say that 5* uniquely produces V . 
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(a) (b) (c) 

Fig. 1. (a) A finite subset of the discrete Sierpinski triangle. This 2-colouring of the set [7] x [7] defines an 
instance of the PATS problem, (b) Assembling the Sierpinski pattern with a TAS that has an appropriate seed 
assembly and a (coloured) tile set shown in (c). 



2.2 The PATS problem 

In this paper we restrict our attention to designing minimal tile assembly systems that construct 
a given pattern in a finite rectangular m by n grid [to] x [n] c Z 2 . This problem was first discussed 
by Ma & Lombardi 0]. 

A mapping from [to] x [n] onto [k] defines a fc-colouring or a fc-coloured pattern. To 
build a given pattern, we start with boundary tiles in place for the west and south borders of 
the to by n rectangle and keep extending this assembly by tiles with strength- 1 glues. 

Definition 1 (Pattern self- Assembly Tile set Synthesis (PATS) 0). 

Given: A k-colouring c ■ [to] x [n] [k]. 
Find: A tile assembly system 3~ = (T, S,s,2) such that 

PI. The tiles in T have bonding strength 1. 

P2. The domain of S is [0,to] x {0} u {0} x [0,n] and all the terminal 
assemblies have the domain [0,to] x [0,n]. 

P3. There exists a colouring d : T -*■ [k] such that for each terminal assem- 
bly At Term 2? we have d(A(x,y)) = c(x,y) for all (x,y) e [m] x [n]. 



In particular, we are interested in the minimal solutions (in terms of |T|) to the PATS 
problem. By the same token, we can make the following assumption: 

Assumption 1. In our TASs, every tile participates in assembling some terminal assembly. 

Ma & Lombardi show a certain derivative of the above optimization problem N P-hard in [3j • 
However, to our knowledge, a proof of the NP-hardness of the PATS problem as stated above 
is lacking. 

As an illustration, we construct a part of the Sierpinski triangle with a 4-tile TAS in Figure 
[TJ We use natural numbers as glue labels in our figures. 

In the literature, the seed assembly of a TAS is often taken to be a single seed tile [l|, [T2] 
whereas we consider an L-shaped seed assembly. These boundaries can always be self-assembled 



using m + n + 1 different tiles with strength-2 glues, but for practical purposes we allow for 
the possibility of using, for example, DNA origami techniques [3] to construct these boundary 
conditions. 

Due to constraint PI the self-assembly process proceeds in a uniform manner directed from 
south-west to north-east. This paves the way for a simple characterization of deterministic TASs 
in the context of the PATS problem. 

Proposition 1. Solutions 3F = (T, S,s,2) of the PATS problem are deterministic precisely 
when for each pair of glue types (o"i,o"2) e there is at most one tile type t e T so that 
o~s(t) = °~i an d o-\y(t) = C2- 

A simple observation reduces the work needed in finding minimal solutions of the PATS 
problem. 

Lemma 1. The minimal solutions of the PATS problem are deterministic TASs. 

Proof. For the sake of contradiction, suppose that jV = (T, S,s,2) is a minimal solution to a 
PATS problem instance and that jV is not deterministic. By the above proposition let tiles 
t\,t% e T be such that crg(ii) = erg (£2) and o-w(ti) = o~w(t2)- Consider the simplified TAS 
Jf' = (T \ {£2}, <S, s, 2). We show that this, too, is a solution to the PATS problem, which 
violates the minimality of \T\. 

Suppose A e Teim^yV'. If A i Term^, then some teT can be used to extend A in Jf . If 
t e T \ {£2}, then t could be used to extend A in ,yV' , so we must have t = £2- But since new tiles 
are always attached by binding to south and west sides of the tile, A could then be extended 
by t\ in JV' . Thus, we conclude that A e Term^Y and furthermore Term^/K' E Term./K. This 
demonstrates that JV' has property P2. The properties PI and P3 can be readily seen to hold 
for r yV' as well. In terms of \T\ we have found a more optimal solution — and a contradiction. □ 

Assumption 2. We consider only deterministic TASs in the sequel. 
3 A branch-and-bound algorithm 

We describe an exact algorithm to find minimal solutions to the PATS problem. We extend the 
methods of [ol to obtain an exhaustive branch-and-bound (B&B) algorithm. The idea of Ma 
& Lombardi [g] (following experimental work of 0j) is to start with an initial tile set that 
consists of m ■ n different tiles, one for each of the grid positions in [m] x [n]. Their algorithm 
then proceeds to merge tile types in order to minimize \T\. We formalize this search process as 
an exhaustive search in the set of all partitions of the set [m] x [n] . In the following, we let a 
PATS instance be given by a fixed fe-coloured pattern c : [m] x [n] -> [k]. 

3.1 The search space 

Let X be the set of partitions of the set [to] x [n]. For partitions P,P' e X we define a relation 
E so that 

P^P' <^ Vp' eP' : 3p 6 P : p' c p . (3) 

Now, (X, e) is a partially ordered set, and in fact, a lattice. If P E P' we say that P' is a 
refinement of P, or that P is coarser than P' . Note that P E P' implies \P\ < \P'\. 

The colouring c induces a partition P(c) = {c _1 ({i}) | i e [k]} of the set [to] x [n]. In 
addition, since every (deterministic) solution S? = (T, S,s,2) of the PATS problem uniquely 
produces some assembly A, we associate with 2? a partition P(^) = {^4 _1 ({i}) | t e ^4([m] x 
[n])}. Here, |P(=^)| = \T\ due to our Assumptions 1 and 2. With this terminology, the condition 
P3 in the definition of the PATS problem is equivalent to requiring that a TAS ^ satisfies 



P(c)eP(J) 



(4) 




(a) (b) 

Fig. 2. (a) Partition A. (b) A partition M that is a refinement of A with \M\ = 7 parts. 

We say that a partition P € X is constructible if P = P(^) for some deterministic TAS 
with properties Pi and PS. With this, we can rephrase our goal from the point of view of 
using partitions as the fundamental search space. 

Proposition 2. A minimal solution to the PATS problem corresponds to a partition P e X 
such that P is constructible, P(c) E P and \P\ is minimal. 

For example, the 2-coloured pattern in Figure [2^i defines a 2-part partition, A, say. The 
7-part partition M in Figure [2b is a refinement of A {A E M) and in fact, M is constructible 
(see Figure [3b) and corresponds to a minimal solution of the PATS problem defined by the 
pattern A. 

3.2 Determining constructibility 

In this section we give an algorithm for deciding the constructibility of a given partition in 
polynomial time. To do this, we use the concept of most general (or least constraining) tile 
assignments. For simplicity, we assume the set of glue labels £ to be infinite. 

Definition 2. Given a partition P of the set [to] x [n], a most general tile assignment 

(MGTA) is a function f : P -*■ U 4 such that 

Al. When every position in [to] x [n] is assigned a tile type according to f , any two adjacent 

positions agree on the glue type of the side between them. 
A 2. For all assignments g : P -*■ U 4 satisfying Al we hav$\ 

/(Pl)Di = /(P2)o 2 =^ gip^D! = g(P2)D 2 (5) 
for all ( P i,D 1 ),(p 2 ,D 2 )tPxV. 

To demonstrate this concept we present a most general tile assignment /:/->■ Z" 4 for the 
initial partition I - {{a} \ a € [to] x [n]} in Figured and a MGTA for the partition of Figure 
[2b in Figure [3b. 



1 To shorten the notation we write f{p)o instead of ao(f(p))- 
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Fig. 3. (a) A MGTA for the constructible initial partition / (with a seed assembly in place), (b) Finished assembly 
for the pattern from Figure [2^,. The tile set to construct this assembly is given in (c). 



Given a partition P e X and a function / : P -*■ E 4 , we say that g ■ P 
f by merging glues a and b if for all (p,D) e P x D we have 



9(p)d = 



f(p)o, 



if f(p)D = b 
otherwise 



U is obtained from 



(6) 



A most general tile assignment for a partition P e X can be found as follows. We start with 
a function /o : P -> I7 4 that assigns to each tile edge a unique glue type, or in other words, a 
function /o so that the mapping (p,D) h» /o(p)_d is injective. Next, we go through all pairs of 
adjacent positions in [m] x [n] (in some order) and require their matching sides to have the same 
glue type by merging the corresponding glues. This process generates a sequence of functions 
/o) fi > Hi ■ ■ ■ i In = f and terminates after N < Iran steps. 

Lemma 2. The above algorithm generates a most general tile assignment. 

Proof. By the end, we are left with a function / that satisfies property Al by construction. To 
see why property A2 is satisfied, we again use the language of partitions. 

Any assignment gives rise to a set of equivalence classes (or a partition) on P x D: Elements 
that are assigned the same glue type reside in the same equivalence class. The initial assign- 
ment /o gives each part-direction pair a unique glue type, and thus, corresponds to the initial 
partition J = {{a} \ a e P x T>}. In the algorithm, any glue merging operation corresponds to 
the combination of two equivalence classes. 

The algorithm goes through a list of pairs {{aijbi}}^ 1 of elements from P x T> that are 
required to have the same glue type. In this way, the list records necessary conditions for property 
Al to hold. This is to say that every assignment satisfying Al has to correspond to a partition 
that is coarser than each of the partitions in C = {J[ai, h]}^ 1 , where J[a,b] is the partition 
obtained from the initial partition by combining parts a and b. Since the set (P x T>, e) is a 
lattice, there exists a unique greatest lower bound inf C of the partitions in C This is exactly 
the partition that the algorithm calculates in the form of the assignment /. As a greatest lower 
bound, inf£ is finer than any partition corresponding to an assignment satisfying Al, but this 
is precisely the requirement for condition A 2. □ 



The above analysis also gives the following. 

Corollary 1. For a given partition, MGTAs are unique up to relabeling of the glue types. 

Thus, for each partition P, we take the MGTA for P to be some canonical representative 
from the class of MGTAs for P. 

For efficiency purposes, it is worth mentioning that MGTAs can be generated iteratively: 
A partition P e X can be obtained by repeatedly combining parts starting from the initial 
partition I: 

I = P 1 5P 2 5-lP N = P . (7) 

As a base case, a MGTA for I can be computed by the above algorithm. A MGTA for each Pj+i 
can be computed from a MGTA for the previous partition Pi by just a small modification: Let 
a MGTA /j : Pj -> U 4 be given for Pj and suppose Pj+i is obtained from Pi by combining parts 
Pi,P2 e Pi- Now, a MGTA /j+i for P- l+ i can be obtained from fi by merging tiles fi{p\) and 
fi{P2)-, that is, merging the glue types on the four corresponding sides. 

We now give the conditions for a partition to be constructible in terms of MGTAs. 

Lemma 3. A partition P e X is constructible iff the MGTA f : P -*■ U 4 for P is infective and 
the tile set f{P) is deterministic in the sense of Proposition^ 

Proof. "=>" : Let Pel be constructible and let the MGTA / : P U 4 for P be given. Let 2T 
be a deterministic TAS such that P{2?) = P. The uniquely produced assembly of 3* induces a 
tile assignment g : P -> U 4 that satisfies property Al. Now using property A2 for the MGTA / 
we see that any violation of the injectivity of / or any violation of the determinism of the tile 
set /(P) would imply such violations for g. But since g corresponds to a constructible partition, 
no violations can occur for g and thus none for /. 

"<=": Let / : P -> U 4 be an injective MGTA with deterministic tile set f(P). Because f(P) 
is deterministic, we can choose glue types for a seed assembly S so that the westernmost and 
southernmost tiles fall into place according to / in the self-assembly process. The TAS ST = 
(/(P), 5, s, 2), with appropriate glue strengths s, then uniquely produces a terminal assembly 
that agrees with / on [m] x [re]. This gives P(3?) E P, but since / is injective, |P| = |/(P)| = 
\P(3T)\ and so P(V) = P. □ 

3.3 An initial search DAG 

Our algorithm performs an exhaustive search in the lattice (X, e) searching for constructible 
partitions. In the search, we maintain and incrementally update MGTAs for every partition we 
visit. First, we describe simple branching rules to obtain a rooted directed acyclic graph search 
structure and later give rules to prune this DAG to a node-disjoint search tree. 

The root of the DAG is taken to be the initial partition / that is always constructible. For 
each partition P e X we next define the set C(P) £ X of children of P. Our algorithm always 
proceeds by combining parts of the partition currently being visited, so for each P' e C(P) we 
will have P' E P. Say we visit a partition P € X. We have two possibilities: 

CI. P is constructible: 

1. If P is not a refinement of the target pattern P(c), that is if P(c) ^ P, we can drop 
this branch of the search, since no possible descendant P' E P can be a refinement of 
P(c) either, (i.e. C(P) = 0) 

2. In case P(c) E P, we can use the MGTA for P to give a concrete solution to the PATS 
problem instance defined by the colouring c. To continue the search and to find more 
optimal solutions we consider each pair of parts {pi,P2} £ P in turn and recursively 
visit the partition P[pi,P2] where the two parts are combined. In fact, by the above 
analysis, it is sufficient to consider only pairs of the same colour: 



C(P) = {P[pi,p 2 ] |pi,p 2 eP, pi*p 2 , 3k€P(c):p 1 ,p 2 ^k} . (8) 



C2. P is not constructible: In this case the MGTA / for P gives f(pi)s = f{P2)s an d 
f(j>i)w = f{jP2)w f° r some parts p\ + p 2 . We continue the search from partition P[pi,p 2 ]- 



C(P) = {P[ Pl ,p 2 ]} . (9) 

To guarantee that our algorithm finds the optimal solution in the case C2 above, we need 
the following. 

Lemma 4. Let P e X be a non- constructible partition, f the MGTA for P and pi,p2 e P, 
Pi ± P2, parts such that f(pi)s = f(P2)s and f(pi)w = f(P2)w- For all constructible C E P we 
have C E P[p\,p 2 ]. 

Proof. Let P, f, p\ and p 2 be as in the statement of the lemma. Let C E P be a constructible 
partition and g : C -> £ 4 the MGTA for C. Since C is coarser than P we can obtain from g 
a tile assignment </ : P -> Z 4 such that </(p) = g(<?), where g e C is the unique part for which 
p£ q. The assignment g' has property Al and so using ^42 for the MGTA / we get that 

f(pi)s = f(j>2)s & f(pi)w = f(j>2)w =^ g'(pi)s = g'{P2)s & g'{pi)w = g'{p2)w ■ 

(io) 

Now, since C is constructible, the identities g(qi)s = g(°2)s an( i ^(^l)^/ = g{ a 2)w can n °t hold 
for any two different parts (71,(72 £ C. Looking at the definition of g', we conclude that p\ £ q 
and p 2 £q for some (7 € C. This demonstrates C E P[pi,p 2 ]- □ 

3.4 Pruning the DAG to a search tree 

Computational resources should be saved by not visiting any partition twice. To keep the 
branches in our search structure node-disjoint, we maintain a list of graphs that store restrictions 
on the choices the search can make. 

For each partition P 3 P(c) we associate a family of undirected graphs {G^j^p^, one for 
each colour region of the pattern P(c). Every part in P is represented by a vertex in the graph 
corresponding to the colour of the part. More formally, the vertex set V(G?) is taken to be 
those parts p e P for which p £ k. (So now, Ufc e p( c ) ^(^0 = ^0 ^ n e dge {pi,p 2 } £ E(G^) 
indicates that the parts p\ and p 2 are not allowed ever to be combined in the search branch in 
question. When we start our search with the initial partition /, the edge sets are initially empty, 
E(G I k ) = 0. At each partition P, the graphs {G^} k£P ^ have been determined inductively and 
the graphs for those children P' € C{P) that we visit are defined as follows. 

Dl. If P is constructible: We choose some ordering {pi,qi}, i = 1,...,N, for similarly 
coloured pairs of parts. Define k e P(c), 1 < % < N to be the colour of the pair {pi,qi}, so 
that Pi,qi £ h- Now, we visit a partition P[pi,qi] if and only if {pi,qi} £ E{Gf ). When we 
decide to visit a child partition P' = P[pj,qj], we define the edge sets {E(G k ')}k £ p^ as 
follows: 

1. We start with the graphs {G k } keP ^ and add the edges {pi,qi} for all 1 < i < j to their 
corresponding graphs. Call the resulting graphs {G* k } keP ^ c y 

2. Finally, as we combine the parts pj and qj to obtain the partition P[pj,qj], we merge 
the vertices pj and qj in the graph G^. (After merging, the neighbourhood of the new 
vertex pj u q,j is the union of the neighbourhoods for pj and qj in Gj*). The graphs 

{G k '} keP ( c ) follow as a result. 
D2. If P is not constructible: Here, the MGTA for P suggests a single child partition 
P' - P[pi,p 2 ] for some p\,p 2 £ I e P(c). If {pi,p 2 } e E(Gf), we terminate this branch 
of the search. Otherwise, we define the graphs {G k }keP(c) to be the graphs {G k }^P{c)^ 
except that in Gf the vertices p\ and p 2 have to be merged. 



I 



Fig. 4. The search tree in the lattice (X, !=). We start with the initial partition I of size \I\ = mn. The partition 
-P(c) defines the PATS problem instance: We search for constructible partitions (drawn as crosses) in the sublattice 
(shaded with darker grey) consisting of those partitions that are refinements of -P(c). The search tree branches 
only at the constructible partitions and the tree branches are node-disjoint. 

One can see that the outcome of this pruning process is a search tree that has node-disjoint 
branches and one in which every possible constructible partition is still guaranteed to be found. 
Figure H] presents a sketch of the search tree. 

Note that we are not usually interested in finding every constructible partition P € X, but 
only in finding a minimal one (in terms of |P|). Next, we give an efficient method to lower-bound 
the partition sizes of a given search branch. 

3.5 The bounding function 

Given a root P e X of some subtree of the search tree, we ask: What is the smallest partition 
that can be found from this subtree? The nodes in the subtree rooted at P consists of those 
partitions P' E P that can be obtained from P by merging pairs of parts that are not forbidden 
by the graphs {G^} keP ^ c y This merging process halts precisely when all the graphs {G k }keP( c ) 
have beed reduced into cliques. As is well known, the size of the smallest clique that a graph G 
can be turned into by merging non-adjacent vertices is given by the chromatic number x(G) 
of the graph G. This immediately gives the following. 

Proposition 3. For every P' E P in the subtree rooted at P and constrained by {G^}fceP(c)j 



Determining the chromatic number of an arbitrary graph is an NP-hard problem. Fortu- 
nately, we can restrict our graphs to be of a special form: graphs that consist only of a clique 
and some isolated vertices. For these graphs, the chromatic numbers are given by the sizes of 
the cliques. 



we have 




fceP(c) 



To see how to maintain graphs in this form, consider as a base case the initial partition /. 
Here, E^G^) = for all k e -P(c), so G l k is of our special form — it has a clique of size 1. For a 
general partition P, we go through the branching rules D1-D2. 

Dl: P is constructible: Since we are allowed to choose an arbitrary ordering {pi,qi}, i = 
1, . . . ,N, for the children P[pi, qi\, we design an ordering that preserves the special form 
of the graphs. For a graph G of our special form, let K(G) £ V(G) consist of those vertices 
that are part of the clique in G. In the algorithm, we first set = G^ for all k e P(c) 
and repeat the following process until every graph is a complete clique. 

1. Pick some colour k e P(c) and an isolated vertex v e V{Hk) \ K(Hk)- 

2. Process the pairs {v,u} for all u e K(Hk) in some order. By the end, update 
to include all the edges {v,u} that were just processed (the size of the clique in 
increases by one). 

A moment's inspection reveals that when the graphs G^ are of our special form, so are 
all of the derived graphs passed on to the children of P. 
D2: P is not constructible: If the algorithm decides to continue the search from a partition 
P' = P[pi,P2], for some pi,P2 £ I e P(c), we have {pi,P2} i E(Gf). This means that either 
PI1P2 e V(Gf) n K(G[), in which case we are merging two isolated vertices, or one of p\ 
or P2 is part of the clique K(Gf), in which case we merge an isolated vertex to the clique. 
In both cases, we maintain the special form in the graphs {G^ }keP(c)- 

3.6 Traversing the search tree 

When running a B&B algorithm we maintain a "current best solution" discovered so far as a 
global variable. This solution gives an upper bound for the minimal value of the tile set size 
and can be used to prune such search branches that are guaranteed (by the bounding function) 
to only yield solutions worse than the current best. There are two general strategies to traverse 
a B&B search tree: Depth-First Search and Best-First Search [2|. Our description of the 
search tree for the lattice X is general enough to allow either of these strategies to be used in 
an actual implementation of the algorithm. 

In the next section we give performance data on our DFS implementation of the B&B 
algorithm. 

4 Results 

The running time of our B&B algorithm is proportional — up to a polynomial factor — to the 
number of partitions the algorithm visits. Hence, we measure the running time in terms of the 
number of merge operations performed in the search. Figure [5^ presents the running time of 
the algorithm to find a minimal solution for random 2-coloured instances of the PATS problem. 
The algorithm was executed for instance sizes 2 x 2, 2 x 3, 3 x 3, 5 x 6 and 6x6; the 20th and 
80th percentiles are shown alongside the median of 21 separate runs for each instance size. For 
the limiting case 6x6, the algorithm spent on the order of two hours of (median) computing 
time on a 2,61 GHz AMD processor. 

Even though B&B search is an exact method, it can be used to find approximate solutions 
by running it for a suitable length of time. Figure [5b illustrates how the best solution found 
up to a point develops as increasingly many steps of the algorithm are run. The figure provides 
data on random 2-coloured instances of sizes from 12 x 12 up to 32 x 32. Because we begin our 
search from the initial partition, the best solution at the first step is precisely equal to the 
instance size. For each size, several different patterns were used. The algorithm was cut off after 
10 6 steps. By this time, an approximate reduction of 58% in the size of the tile set was achieved 
(cf. a reduction of 43.5% in [6J]). 
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Fig. 5. (a) Running time of the algorithm (as measured by the number of merge operations) to solve random 
2-coloured near-square-shaped instances of the PATS problem, (b) Evolution of the tile set size of the "current 
best solution" for several large random 2-coloured instances of the PATS problem. 



Next, we consider two well known examples of structured patterns: the discrete Sierpinski 
triangle (part of which was shown in Figure [T]) and the binary counter (see Figure 1 in [12| ) . A 
tile set of size 4 is optimal for both of these patterns. First, for the Sierpinski pattern, we get a 
tile reduction of well over 90% (cf . 45% in [6j ) in Figure Eh.. We used the same cutoff threshold 
and instance sizes as in Figure [5b. Our description of the B&B algorithm leaves some room for 
randomization in deciding which search branch a DFS is to explore next. This randomization 
does not seem to affect the search dramatically when considering the Sierpinski pattern — the 
separate single runs in Figure [6k. are representative of an average randomized run. By contrast, 
for the binary counter pattern, randomized runs for single instance size do make a difference. 
Figure [6b depicts several seperate runs for instance size 32 x 32. Here, each run brings about a 
reduction in solution size that oscillates between a reduction achieved on a random 2-coloured 
instance ([5b) and a reduction achieved on the Sierpinski instance ([6h.). This suggests that, as is 
characteristic of DFS traversal, restarting the algorithm with a different random seed may help 
with large instances that have small optimal solutions. 

5 Conclusion 

We have presented an exact branch-and-bound algorithm for finding minimum-size tile sets that 
self-assemble a given fc-coloured pattern in a uniform self-assembly setting. Simulation results 
indicate that our algorithm is able to find provably minimal tile sets for random instances of 
sizes up to 6 x 6 and can give approximate solutions for larger instances as well. 

One research direction to pursue would be to study tile sets that self-assemble an infinite, 
but finite-period pattern. Does this generalization reduce easily to the finite case? Do there 
exist minimal tile sets that tile the plane aperiodically while still producing a periodic colour 
pattern? 
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Fig. 6. Evolution of the "current best solution" for (a) the Sierpinski pattern and for (b) the binary counter 

pattern. Randomization in the DFS has a clear effect on the performance of the algorithm in the case of the 
binary counter pattern. 
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